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Abstract. We present an algorithm for computing a maximum agreement subtree of two un- 
rooted evolutionary trees. It takes 0(n 1,5 log n) time for trees with unbounded degrees, matching the 
best known time complexity for the rooted case. Our algorithm allows the input trees to be mixed 
trees, i.e., trees that may contain directed and undirected edges at the same time. Our algorithm 
adopts a recursive strategy exploiting a technique called label compression. The backbone of this 
technique is an algorithm that computes the maximum weight matchings over many subgraphs of a 
bipartite graph as fast as it takes to compute a single matching. 

1. Introduction. An evolutionary tree is one whose leaves are labeled with 
distinct symbols representing species. Evolutionary trees are useful for modeling the 
evolutionary relationship of species |l],f|,||, [l6||l7|,|2jj. An agreement subtree of two 
evolutionary trees is an evolutionary tree that is also a topological subtree of the two 
given trees. A maximum agreement subtree is one with the largest possible number 
of leaves. Different models about the evolutionary relationship of the same species 
may result in different evolutionary trees. A fundamental problem in computational 
biology is to determine how much two models of evolution have in common. To 
a certain extent, this problem can be solved by computing a maximum agreement 
subtree of two given evolutionary trees |l2| . 

Algorithms for computing a maximum agreement subtree of two unrooted evo- 
lutionary trees as well as two rooted trees have been studied intensively in the past 
few years. The unrooted case is more difficult than the rooted case. There is indeed 
a linear-time reduction from the rooted case to the unrooted one, but the reverse is 
not known. Steel and Warnow pi| gave the first polynomial-time algorithm for un- 
rooted trees, which runs in 0(n 4 5 logn) time. Farach and Thorup reduced the time 
to 0{n 2+ °^) for unrooted trees (lO) and 0(n 15 logn) for rooted trees |Oj. For the 
unrooted case, the time was improved by Lam, Sung and Ting to 0(n 175+ °( 1 ^). 
Algorithms that work well for rooted trees with degrees bounded by a constant have 
also been revealed recently. The algorithm of Farach, Przytycka and Thorup |9| takes 
0(nlog 3 n) time, and that of Kao |2(| takes 0(n log 2 n) time. Cole and Hariharan |tJ 
gave an 0(nlogn)-time algorithm for the case where the input is further restricted 
to binary rooted trees. 

This paper presents an algorithm for computing a maximum agreement subtree 
of two unrooted trees. It takes 0(n 15 logn) time for trees with unbounded degrees, 
matching the best known time complexity for the rooted case jy} ■ If the degrees are 
bounded by a constant, the running time is only 0(nlog 4 n). We omit the details of 
this reduction since Przytycka |p3f recently devised an 0(n log n)-time algorithm for 
the same case. 
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Our algorithm allows the input trees to be mixed trees, i.e., trees that may con- 



tain directed and undirected edges at the same time [|15, 18 1. Such trees can handle a 
broader range of information than rooted and unrooted trees. To simplify the discus- 
sion, this paper focuses on unrooted trees. Our subtree algorithm adopts a concep- 
tually simple recursive strategy exploiting a novel technique called label compression. 
This technique enables our algorithm to process overlapping subtrees iteratively while 
keeping the total tree size very close to the original input size. Label compression 
builds on an unexpectedly fast algorithm for the all- cavity maximum weight matching 
problem pij , which asks for the weight of a maximum weight matching in G — {u} 
for each node u of a bipartite graph G with integer edge weights. If G has n nodes, 
m edges and maximum edge weight N, the algorithm takes 0(y/nm\og(nN)) time, 
which matches the best known time bound for computing a single maximum weight 
matching of G due to Gabow and Tarjan 

In we solve the all-cavity matching problem. In §||, we formally define maxi- 
mum agreement subtrees and outline our recursive strategy for computing them. We 
describe label compression in §|], detail our subtree algorithm in §|1 and discuss how 
to compute auxiliary information for label compression in §|^ and §0. We conclude by 
extending the subtree algorithm to mixed trees in §||. 

2. All-cavity maximum weight matching. Let G = (X,Y,E) be a bipartite 
graph with n nodes and m edges where each edge (u, v) has a positive integer weight 
w(u,v) < N. Let mwm(G) denote the weight of a maximum weight matching in G. 
The all-cavity matching problem asks for mwm(G — {u}) for all u € X U Y. A naive 
approach to solve this problem is to compute mwm(G — {u}) separately for each u 
using the fastest algorithm for computing a single maximum weight matching [ fl3|| , 
thus taking 0(n 15 m log(niV)) total time. A main finding of this paper is that the 
matchings in different subgraphs G — {u} are closely related and can be represented 
succinctly. From this representation, we can solve the problem in 0(y/nm log(niV)) 
time. By symmetry, we only detail how to compute mwm(G — {u}) for all u S X . 
Below we assume m > n/2; otherwise, we remove the degree-zero nodes and work on 
the smaller resultant graph. 

A node v of G is matched by a matching of G if v is an endpoint of an edge in 
the matching. In the remainder of this section, let M be a fixed maximum weight 
matching of G; also let w(H) be the total weight of a set H of edges. An alternating 
path is a simple path P in G such that (1) P starts with an edge in M, (2) the edges 
of P alternate between M and E — M, and (3) if P ends at an edge (u, v) £ M, then 
v is not matched by M. An alternating cycle is a simple cycle G in G whose edges 
alternate between M and E — M . P (respectively, G) can transform M to another 
matching M ' = P U M - P n M (respectively, CUM-Cfl M). The net change 
induced by P, denoted by A(P), is w(M') — w[M), i.e., the total weight of the edges 
of P in E — M minus that of the edges of P in M. The net change induced by G is 
defined similarly. 

The next lemma divides the computation of mwm(G — {u}) into two cases. 
Lemma 2.1. Let u € X. 

1. If u is not matched by M, then M is also a maximum weight matching in 
G — {u} and mwm(G — {u}) = mwm(G). 

2. If u is matched by M , then G contains an alternating path P starting from 
u, which can transform M to a maximum weight matching in G — {u}. 

Proof. Statement 1 is straightforward. To prove Statement 2, let M' be a maxi- 
mum weight matching in G — {u}. Consider the edges in M U M' — M PI M'. They 
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Fig. 2.1. (a) a bipartite graph G; (b) the corresponding directed graph D. 



form a set S of alternating paths and cycles. Since u is matched by M but not by 
M' , u is of degree one in M U M 1 — M n M'. Let P be the alternating path in 5 with 
u as an endpoint. Let M" be the matching obtained by transforming M only with P. 
Since u is not matched by M", M" is a matching in G — {u}. M' can be obtained by 
further transforming M" with the remaining alternating paths and cycles in S. The 
net change induced by each of these alternating paths and cycles is non-positive; oth- 
erwise, such a path or cycle can improve M and we obtain a contradiction. Therefore, 
w(M") > w(M'), i.e., both M' and M" are maximum weight matchings in G — {u}. 
□ 

By Lemma we can compute mwm(G — {u}) for any u G X matched by M 

by finding the alternating path starting from u with the largest net change. Below we 
construct a directed graph D, which enables us to identify such an alternating path 
for every node easily. The node set of D is X U Y U {t}, where t is a new node. The 
edge set of D is defined as follows; see Figure |2.1| for an example. 

• If x € X is not matched by M, D has an edge from x to t with weight zero. 

• If y S Y is matched by M, D has an edge from y to t with weight zero. 

• If M has an edge (x, y) where x S X and y G Y, D has an edge from x to y 
with weight — w{x,y). 

• li E — M has an edge (x, y) where a; G X and y E Y, D has an edge from y 
to a; with weight w(x,y). 

Note that D has n + 1 nodes and at most n + m edges. The weight of each edge 
in D is an integer in [-N, N]. 
Lemma 2.2. 

1. D contains no positive- weight cycle. 

2. Each alternating path P in G that starts from u G X corresponds to a simple 
path Q in D from u to t, and vice versa. Also, A(P) = w(Q). 

3. For each u G X matched by M , mwm(G — { u }) * s the sum o/mwm(G) and 
the weight of the longest path in D from u to t. 

Proof. 

Statement 1. Consider a simple cycle G = u\, u^, • • • , u&, u\ in D. Since t has no 
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outgoing edges, no Uj equals t. By the definition of D, C is also an alternating cycle 
in G. Therefore, w(C) is the net change induced by transforming M with G. Since 
M is a maximum weight matching in G, this net change is non-positive. 

Statement 2. Consider an alternating path P — u, U\, 112, ■ ■ ■ , Uk in G starting 
from u. In D, P is also a simple path. If Uk £ X, then Uk is not matched by M, and 
D contains the edge (u^, t). If Uk £ Y, then is matched by M, and £) again contains 
the edge (uk, t). Therefore, D contains the simple path Q — u, u±, U2, • • • yU-k, t. The 
weight of Q is A(P). The reverse direction of the statement is straightforward. 

Statement 3. This statement follows from Lemma 2T(||) and Statement [| □ 



Theorem 2.3. Given G , we can compute mwm(G — {u}) for all nodes u £ G in 
0{y/nm\og(nN)) time. 

Proof. By symmetry and Lemmas 2.1 (ll) and 2.2(|), we compute mwm(G- { u }) 



for all u £ X as follows. 

1. Compute a maximum weight matching M of G. 

2. Construct D as above and find the weights of its longest paths to t. 

3. For each u £ X, if u is matched by M, then mwm(G — {u}) is the sum of 
mwm(G) and the weight of the longest path from u to t in D; otherwise, 
mwm(G — {u}) = mwm(G). 

Step 1 takes 0(^/nm log(niV)) time. At Step 2, constructing D takes 0(n + m) time, 
and the single-destination longest paths problem takes 0(^/nm log N) time [Q. Step 
3 takes 0(n) time. Thus, the total time is 0(y / nm log(niV)). □ 

3. The main result. This section gives a formal definition of maximum agree- 
ment subtrees and an overview of our new subtree algorithm. 



3.1. Basics. Throughout the remainder of this paper, unrooted trees are de- 
noted by U or X, and rooted trees by T, W or R. A node of degree or 1 is a 
leaf] otherwise, it is internal. Adopted to avoid technical trivialities, this definition 
is somewhat nonstandard in that if the root of a rooted tree is of degree 1, it is also 
a leaf. 

For an unrooted tree U and a node u £ U, let U u denote the rooted tree con- 
structed by rooting U at u. For a rooted tree T and a node v £ T, let T v denote 
the rooted subtree of T that comprises v and its descendants. Similarly, for a node 
v £ U u , U uv is the rooted subtree of U u rooted at v, which is also called a rooted 
subtree of U. 

An evolutionary tree is a tree whose leaves are labeled with distinct symbols. 
Let T be a rooted evolutionary tree with leaves labeled over a set L. A label subset 
L' C L induces a subtree of T, denoted by T\L', whose nodes are the leaves of 
T labeled over L' as well as the least common ancestors of such leaves in T, and 
whose edges preserve the ancestor-descendant relationship of T. Consider two rooted 
evolutionary trees T\ and T2 labeled over L. Let T[ be a subtree of T\ induced by 
some subset of L. We similarly define T' 2 for T 2 . If there exists an isomorphism 
between T[ and T' 2 mapping each leaf in T[ to one in T' 2 with the same label, then T[ 
and T 2 are each called agreement subtrees of T\ and T 2 ■ Note that this isomorphism 
is unique. Consider any nodes u £ T% and v £ T 2 . We say that u is mapped to v in 
T[ and T 2 if this isomorphism maps u to v. A maximum agreement subtree of T\ and 
T 2 is one containing the largest possible number of labels. Let mast(Ti,T2) denote 
the number of labels in such a tree. A maximum agreement subtree of two unrooted 
evolutionary trees U\ and U2 is one with the largest number of labels among the 
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maximum agreement subtrees of and C/ 2 U over all nodes u G XJ\ and v £ U 2 . Let 



(3.1) mast(£7i, U 2 ) = max{mast(f/«, U%) \ueU%,ve U 2 }. 

Remark. The nodes u (or v) can be restricted to internal nodes when the trees 
have at least three nodes. We can also generalize the above definition to handle a 
pair of rooted tree and unrooted tree (T,U). That is, mast(T, U) is defined to be 
max{mast(T, U v ) | v G U}. 

3.2. Our subtree algorithm. The next theorem is our main result. The size 
\U\ (or |T|) of an unrooted tree U (or a rooted tree T) is its node count. 

Theorem 3.1. Let U\ and U 2 be two unrooted evolutionary trees. We can com- 
pute mast(f7i, U 2 ) in 0(N 1 ' 5 log N) time, where N = max{|(7i|, \U 2 \}. 

We prove Theorem |3.l| by presenting our algorithm in a top-down manner with 
an outline here. As in previous work, our algorithm only computes mast([/i, U 2 ) and 
can be augmented to report a corresponding subtree. It uses graph separators. A 
separator of a tree is an internal node whose removal divides the tree into connected 
components each containing at most half of the tree's nodes. Every tree that contains 
at least three nodes has a separator, which can be found in linear time. 



If Ui or U 2 has at most two nodes, mast(L/i, U 2 ) as defined in Equation (3.1) can 
easily be computed in O(N) time. Otherwise, both trees have at least three nodes 
each, and we can find a separator x of U\. We then consider three cases. 

Case 1: In some maximum agreement subtree of U\ and U 2 , the node x is 
mapped to a node y G U 2 - In this case, mast([/i, U 2 ) — mast({/f , U 2 ). To com- 
pute mast(t/f , U 2 ), we might simply evaluate mast([/f , £/f ) for different y in U 2 . This 
approach involves solving the mast problem for 9(A) different pairs of rooted trees 
and introduces much redundant computation. For example, consider a rooted subtree 
R of U 2 . For all y G U 2 — R, R is a common subtree of U\. Hence, R is examined 
repeatedly in the computation of mast(t/f , C/|) for these y. To speed up the computa- 
tion, we devise the technique of label compression in to elicit sufficient information 
between Z7f and R so that we can compute mast(f7f, t/f ) for all y G U 2 — R with- 
out examining R. This leads to an efficient algorithm for handling Case 1, the time 
complexity is stated in the following lemma. 

Lemma 3.2. Assume that Ui and U 2 have at least three nodes each. Given an 
internal node x G U%, we can compute mast(ff , U 2 ) in 0(N 15 \ogN) time. 

Proof. See §| to §@. □ 

Case 2: In some maximum agreement subtree of U\ and U 2 , two certain nodes 
vi and v 2 of Ui are mapped to nodes in U 2 , and x is on the path in U\ between v\ 
and v 2 . This case is similar to Case 1. Let U 2 be the tree constructed by adding a 
dummy node in the middle of every edge in U 2 . Then, mast([/i, U 2 ) — mast({/f , t/f ) 
for some dummy node y in U 2 . Thus, mast(/7i, U 2 ) = m&st(U?,U 2 ). As in Case 1, 
mast({/f , U 2 ) can be computed in 0(N 1 - 5 log AT) time. 

Case 3: None of the above two cases. Let U\,x, U% >2 , . . . , Ui t b be the evolutionary 
trees formed by the connected components of U\ — {x}. Let J\, ■ ■ ■ , Jb be the sets of 
labels in these components, respectively. Then, a maximum agreement subtree of U± 
and U 2 is labeled over some Ji. Therefore, mast(f/i, U 2 ) = max{mast(t r i ! i, U 2 \ Ji) \ 
i G [1, b]}, and we compute each mast(?7i i i, U 2 \ Ji) recursively. 

Figure [O] summarizes the steps for c omp uting mast(C/i, U 2 ). Here we analyze the 
time complexity T(N) based on Lemma |3.2[ Cases 1 and 2 each take 0(N 15 log A) 

5 



/* Ui and Ui are unrooted trees. */ 
mast([/i, U2) 

find a separator x of U\\ 

construct [/ 2 by adding a dummy node iu at the middle of each edge (u,v) in 
U 2 ; 

val = mast(L7f , U 2 )\ 
val' = mast(L7f , E/ 2 ); 

let l/i,!, tTj.,2) ■ ■ ■ 1 f 1,6 be the connected components of C/i — {x}; 
for all i € [1, 6], let Ji be the set of labels of Ui f, 
for all i S [1, 6], set vah = mast(J/i,j, [/ 2 | Ji)}', 
return maxjval, val , maxi<i<{, vah,}; 



Fig. 3.1. Algorithm for computing mast ( Ui, C/2). 



time. Let A^ = |t7i.i|. Then Case 3 takes b] T(Ni) time. By recursion, 

T(N) =0(N 1 - 5 logN)+ T ( N i)- 

ie[i,b] 

Since a; is a separator of U\, iVj < ^. Then, since 5Zie[i 6] N% < iV, T(N) = 
0(N 15 log AT) [pl |l9| and the time bound in Theorem [O] follows. To complete the 
proof of Theorem 3.1, we devote §|] through §[?] to proving Lemma 3.2. 



4. Label compressions. To compute a maximum agreement subtree, our algo- 
rithm recursively processes overlapping subtrees of the input trees. The technique of 
label compression compresses overlapping parts of such subtrees to reduce their total 



size. We define label compressions with respect to a rooted subtree in §4.1 and with 



respect to two label-disjoint rooted subtrees in §4.2. We do not use label compression 
with respect to three or more trees. 

As a warm-up, let us define a concept called subtree shrinking, which is a primitive 
form of label compression. Let T be a rooted tree. Let R be a rooted subtree of T. 
Let TQR denote the rooted tree obtained by replacing R with a leaf 7. We say that 
7 is a shrunk leaf. The other leaves are atomic leaves. Similarly, for two label-disjoint 
rooted subtrees R\ and R2 of T, let T0(i?i,i? 2 ) denote the rooted tree obtained by 
replacing Ri and R2 with shrunk leaves 71 and 72, respectively. We extend these 
notions to an unrooted tree U and define UQR and UQ(Ri, R2) similarly. 

4.1. Label compression with respect to one rooted subtree. Let T be a 

rooted tree. Let v be a node in T and u an ancestor of v. Let P be the path of T 
from u to v. A node lies between u and v if it is in P but differs from u and v. A 
subtree of T is attached to u if it is some T w where w is a child of u. A subtree of T 
hangs between u and v if it is attached to some node lying between u and v, but its 
root is not in P and is not v. 

We are now ready to define the concept of label compression. Let T and R be 
rooted evolutionary trees labeled over L and K, respectively. The compression of T 
with respect to R, denoted by T®R, is a tree constructed by affixing extra nodes to 



T\(L — K) with the following steps; see Figure |4- 1| for an example. Consider each 
node y in T\(L — K), let x be its parent in T\(L — K). 

• Let A(T, K, y) denote the set of subtrees of T that are attached to y and 
whose leaves are all labeled over K. If A(T, K, y) is non-empty, compress all 
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Fig. 4.1. An example of label compression. 



the trees in A(T, K, y) into a single node Z\ and attach it to y. 
• Let H(T, K, y) denote the set of subtrees of T that hang between x and y (by 
definition of T\{L — K), these subtrees are all labeled over K). If H(T, K, y) 
is non-empty, compress the parents p\ , . . . , p m of the roots of the trees in 
H(T, K, y) into a single node p\, and insert it between x and y; also compress 
all the trees in Tt(T, K, y) into a single node z 2 and attach it to p\. 
The nodes Zi, Z2 and p\ are called compressed nodes, and the leaves in T(&R that are 
not compressed are atomic leaves. 

We further store in T®R some auxiliary information about the relationship be- 
tween T and R. For an internal node v in T®R, let a(v) — mast(T 1 ', R). For a 
compressed leaf v in T(g)R, if it is compressed from a set of subtrees T Vl , . . . , T Vs , let 
a(v) =max{mast(T" 1 ,i?), . . . , mast(T% i?)}. 

Let Ti and T2 be two rooted evolutionary trees. Assume T 2 contains a rooted 
subtree R. Given Ti®R, we can compute mast(Ti,T2) without examining R. We 
first construct T%QR by replacing R of T2 with a shrunk leaf and then compute 
mast(Ti, T2) from Ti<g)i? and T2QR. To further our discussion, we next generalize the 
definition of maximum agreement subtree for a pair of trees that contain compressed 
leaves and a shrunk leaf, respectively. 

Let W\ = T^R and W 2 = T 2 QR. Let 7 be the shrunk leaf in W 2 . We define an 
agreement subtree of W\ and W 2 similar to that of ordinary evolutionary trees. An 
atomic leaf must still be mapped to an atomic leaf with the same label. However, the 
shrunk leaf 7 of W 2 can be mapped to any internal node or compressed leaf v of W\ 
as long as a[v) > 0. The size of an agreement subtree is the number of its atomic 
leaves, plus a(v) if 7 is mapped to a node v € W%. A maximum agreement subtree of 
W\ and W 2 is one with the largest size. Let mast(Wi, W 2 ) denote the size of such a 
subtree. The following lemma is the cornerstone of label compression. 
Lemma 4.1. mast(T 1 ,T 2 ) = mast(Wi, Wb). 
Proof. It follows directly from the definition. □ 

We can compute mast(Wi, W2) as if W\ and W 2 were ordinary rooted evolu- 
tionary trees j^,[TT],^(| with a special procedure on handling the shrunk leaf. The 
time complexity is stated in the following lemma. Let n = max{ | W\ \ , | W 2 \ } and 
A = max{|T 1 |,|T 2 |}. 

Lemma 4.2. Suppose that all the auxiliary information of W\ has been given. 
Then mast(Wi, W 2 ) can be computed in 0(n 15 \ogN) time and afterwards we can 
retrieve mast(W^, W 2 ) for any node v S W\ in 0(1) time. 

Proof. We adapt Farach and Thorup's rooted subtree algorithm |ll]] to compute 
mast(W / i, W 2 ). Details are given in §|X|. □ 
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We demonstrate a scenario where label compression speeds up the computation 



of mast(J7f , £/ 2 ) for Lemma 3.2. Suppose that we can identify a rooted subtree R of 



U 2 such that x is mapped to a node outside R, i.e., we can reduce Equation ( |3.1| ) to 

(4.1) mast(Z7f , U2) = max{mast(L T f, U%) \ y is an internal node not in R}. 

Note that every 11% contains R as a common subtree. To avoid overlapping compu- 
tation on R, we construct W = U?®R and X = U 2 QR. Then X y = U%QR and from 



Lemma LI, mast(£7f , V\ ) = mast(VL, X y ). We rewrite Equation (LI) as 



(4.2) mast(Z7f, C/ 2 ) = max{mast(W / , X 4 ') | y is an internal node of X}. 

If R is large, then W and X are much smaller than Uf and U^- Consequently, it is 
beneficial to compress f/f and compute mast(C/f , C/2) according to Equation ((L^). 

4.2. Label compression with respect to two rooted subtrees. Let T, 

i?2 be rooted evolutionary trees labeled over L, Ki, K2, respectively, where 
Ki n K% = 4>. Let K = K\ U K2. The compression of T with respect to i?i and R2, 
denoted by T<B>(Ri, -R2), is a tree constructed from T|(L — if) by the following two 
steps. For each node y and its parent x in T\(L — K), 

1. if -4(T, if, j/) is non-empty, compress all the trees in A(T, K, y) into a single 
leaf and attach it to y; create and attach an auxiliary node z to y, 

2. if 7i(T, K,y) is non-empty, compress the parents p±, . . ., p rn of the roots of 
the subtrees in H(T, K, y) into a single node p\ and insert it between x and 
y; compress the subtrees in TC(T, K,y) into a single node z and attach it to 
Px\ create and insert an auxiliary node p\ between pi and y; create auxiliary 
nodes z and z and attach them to p± and p± , respectively. 

The nodes p\ and z are compressed nodes of T<g>(Ri, R2). The nodes pi,z, and I 
are auxiliary nodes. These nodes are added to capture the topology of T that is 
isomorphic with the subtrees i?i and i?2 of T' . 

We also store auxiliary information in T®{R\, i?2)- Let R + be the tree obtained 
by connecting i?i and i?2 together with a node, which becomes the root of R + . 

Consider the internal nodes of T®(R\, R2). If v is an internal node inherited 
from T\(L - K), then let a x {v) = mast(T", Ri) and a 2 (v) = mast(T u , R 2 ). If pi 
and pi are internal nodes compressed from some path pi, . . . ,p m of T, then only p\ 
stores the values a x (pi) — mast(T Pl , Ri), a 2 (pi) — mast(T Pl , i? 2 ), and a+(pi) = 
mast^P 1 ,^). 

We do not store any auxiliary information at the atomic leaves in T(g>(R±, R2). 
Consider the other leaves in T<g>(i?i, i?2) based on how they are created. 

Case 1: Nodes z, z are leaves created with respect to A(T, K, y) for some node y 
in T\(L - K). Let A(T, K, y) = {T v \ . . . , T Vk }. We store the following values at z. 

• a x (z) = max{mast(T t '%i?i) | i G [l,k]}, a 2 (z) = max-jmast^, R 2 ) \ i € 
[l,fc]}, a+(z) = max{mast(r"S R+) \ i G [l,fc]}; 

• (3(z) = maxjmast^si?!) + mast(T"<', R 2 ) \ T v > and T v *' are distinct sub- 
trees in A(T, K,y)}. 

Case 2: Nodes z, z, and z are leaves created with respect to the subtrees in 
H(T, K, y) = {T v \. . . , T v "} for some node y in T\(L - K). We store the following 
values at z: 

• a 1 (z), a 2 (z), and a + {z) as in Case 1; 

• (3{z) = max{mast(T 1 '*, i?i)+mast(T^, i? 2 ) | T Vt and T v i are distinct subtrees 
in H(T, K, y) that are attached to the same node in T}; 



ft v2 (z) = max{mast(T^,i?i) + mast(TV , i? 2 ) | g and 

I3 2>1 (z) = max{mast(T%i? 2 ) + mast(TV j i? 1 ) j (j,/) e Z}, 
where Z = {(j,f) \ T% T v i' e Tt(T, K, y) and the parent of Vj in T is a 
proper ancestor of the parent of Uj' }. 
Let T\ and T2 be rooted evolutionary trees. Let R\ and R 2 be label-disjoint 
rooted subtrees of T 2 . Let Wi = T®(R 1 ,R 2 ) and W 2 = T l Q(R ll R 2 ). Below, we give 
the definition of a maximum agreement subtree of W\ and W 2 ■ 

Let 71 and 72 be the two shrunk leaves in W 2 representing R\ and R 2 , respectively. 
Let y c be the least common ancestor of 71 and 72 in W 2 . Intuitively, in a pair of 
agreement subtrees (W[,W 2 ) of W\ and W 2 , atomic leaves are mapped to atomic 
leaves, and shrunk leaves are mapped to internal nodes or leaves. Moreover, we allow 
W 2 to contain y c as a leaf, which can be mapped to an internal node or leaf of W{ ■ 
More formally, we require that there is an isomorphism between W[ and W 2 satisfying 
the following conditions: 

1. Every atomic leaf is mapped to an atomic leaf with the same label. 

2. If W 2 contains y c as a leaf and thus neither 71 nor 72 is found in W 2 , then y c 
is mapped to a node v with a + (v) > 0. 

3. If only one of 71 and 72 exists in W 2 , say 71, then it is mapped to a node v 
with a 1 (v) > 0. 

4. If both 71 and 72 exist in W 2 , then any of the following cases is permitted: 

• 71 and 72 are respectively mapped to a compressed leaf z and its sibling 
z in W{ with P(z) > 0. 

• 71 and 72 are respectively mapped to a compressed leaf z and the ac- 
companying auxiliary leaf z in W[ with /3 ly2 (z) > 0, or the leaves z and 
z in W{ with f3* yi (z) > 0. 

• 71 and 72 are respectively mapped to two leaves or internal nodes v and 
w with ai(v), a 2 (w) > 0. 

The way we measure the size of W{ and W 2 depends on their isomorphism. For 
example, if y c is mapped to some node v in W{, then the size is the total number 
of atomic leaves in W[ plus a + (v). More precisely, the size of W[ and W 2 is defined 
to be the total number of atomic leaves in W[ plus the corresponding a or f3 values 
depending on the isomorphism between W{ and W 2 . A maximum agreement subtree 
of W\ and W 2 is one with the largest possible size. Let mast(Wi, W 2 ) denote the size 



of such a subtree. The following lemma, like Lemma [Ll|, is also the cornerstone of 
label compression. 

Lemma 4.3. mast(Ti,T 2 ) = mast(Wi, W 2 ). 

Proof. It follows directly from the definition of mast(W r i, W 2 ). □ 

Again, mast(W r i, W 2 ) can be computed by adapting Farach and Thorup's rooted 
subtree algorithm Jll]]. The time complexity is stated in the following lemma. Let n 
= max{|Wi|,|W 2 |} and N = max{|Ti|, \T 2 \}. 

Lemma 4.4. Suppose that all the auxiliary information of W\ has been given. 
Then we can compute mast(I / Fi, W 2 ) in 0(n 15 logiV) time. Afterwards we can retrieve 
mast(VF^, W 2 ) for any v G W in 0(1) time. 

Proof. See §111. □ 



5. Computing mast({7^, U2) — Proof of Lemma |3.2| . At a high level, we 
first apply label compression to the input instance (Uf,U 2 ). We then reduce the 
problem to a number of smaller subproblems (W,X), each of which is similar to 
(ff , U 2 ) and is solved recursively. For each (W,X) generated, A is a subtree of U 2 
with at most two shrunk leaves, and W is a label compression of Uf with respect to 
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/* W is a rooted tree with compressed leaves. X is unrooted with shrunk leaves. */ 
mast(W,X) 

let y be a separator of X; 

val = mast(W,X y ); 

if (X has at most one shrunk leaf) or (y lies between the two shrunk leaves) 
then 

new_subproblem(M / , X, y); 
for each (Wi,Xi), vaL = mast(W / i, Xi); 
else 

let ?/ be the node on the path between the two shrunk leaves that is the closest to 
V\ 

val = mast(W,X H ); 
new_subproblem(W, X, y'); 
for each (Wi,Xi), set vaL = mast (Wi,Xi); 
return max{val, max' =1 vaL}; 



/* Generate new subproblems {(Wi, Xi), . . . , (W&, Xb)}- */ 
new_subproblem(M / , X, y) 

let vi, . . . , Vb be the neighbors of y in X; 

for all i e [1,6] 

let Xi be the unrooted tree formed by shrinking the subtree X ViV into a shrunk leaf; 
let Wi be the rooted tree formed by compressing W with respect to X ViV ; 
compute and store the auxiliary information in Wi for all i £ [1,6]; 

Fig. 5.1. Algorithm for computing mast(Vy, X). 



some rooted subtrees of U 2 that are represented by the shrunk leaves of X. Also, W 
and X contain the same number of atomic leaves. 

5.1. Recursive computation of mast(W, X). Our subtree algorithm initially 
sets W = Uf and X = U 2 . In general, W = Uf®R and X = U 2 eR, or W = 
U?®(R, R') and X = U 2 e(R, R') for some rooted subtrees R and R' of U 2 - If W or 
X has at most two nodes, then mast(W,X) can easily be computed in linear time. 
Otherwise, both W and X each have at least three nodes. Let N = max{|[/i|, \U 2 \] 
and n = maxjlVFl, \X\}. Our algorithm first finds a separator y of X and computes 
m&st(W,X) for the following two cases. The output is the larger of the two cases. 



Figure 5.1 outlines our algorithm. 

Case 1: mast(VF, X) = mast(ll / ", X y ). We root X at y and evaluate mast(W, X y ). 
By Lemma [Q| , this takes 0(n 15 logiV) time. 

Case 2: mast(W / , X) = ma.st(W, X z ) for some internal node z 7^ y. We compute 
max{mast(H /r , X z ) \ z is an internal node and z 7^ y} by solving a set of subproblems 
{mast(W / i, Xi), . . ., mast(W / b, Xb)} where their total size is n and max{mast(W / , X z ) \ 
z is an internal node and z =/= y} = max{mast ( Wi, Xi) \ i £ [l,b]}. Moreover, our 
algorithm enforces the following properties. 

• If X contains at most one shrunk leaf, every subproblem generated has size 
at most half that of X. 

• If X has two shrunk leaves, at most one subproblem (Wi o ,Xi o ) has size 
greater than half that of X, but Xi o contains only one shrunk leaf. Thus, in 
the next recursion level, every subproblem spawned by (Wi o ,Xi o ) has size at 
most half that of X. 
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To summarize, whenever the recursion gets down by two levels, the size of a subprob- 
lem reduces by half. 

The subproblems mast(Wi, -X'i), ■ ■ ., mast(Wi,, Xb) are formally defined as follows. 
Assume that the separator y has b neighbors in X, namely, Vi,...,Vf,. For each 
i G [1,6], let Cj be the connected component in X — {y} that contains V{. The size 
of Ci is at most half that of X. Intuitively, we would like to shrink the subtree X ViV 
into a leaf, producing a smaller unrooted tree Xi. We first consider the simple case 
where X has at most one shrunk leaf. Then no Ci contains more than one shrunk 
leaf. 

If Ci contains no shrunk leaf, then Xi contains only one shrunk leaf representing 
the subtree X ViV . Note that X ViV corresponds to the subtree U% tV in U 2 and Xj = 
U 2 eU^ y . Let Wi = Uf®U% iV , 

If Cj contains one shrunk leaf 71 then Xi contains 71 as well as a new shrunk 
leaf representing the subtrees X ViV . The two subtrees are label-disjoint. Again, X ViV 
corresponds to the subtree U% iV in U 2 . Assume that 71 corresponds to a subtree U% y 
in U 2 . Then X t = U 2 e(U% V , V? v ). Let Wi = Uf ®{U V 2 ' V ' , U? v ). 

We now consider the case where X itself already has two shrunk leaves 71 and 72 . 
If y lies on the path between 71 and 72, then no Ci contains more than one shrunk 
leaf and we define the smaller problem instances (Wi,Xi) as above. Otherwise, there 
is a Ci containing both 71 and 72. Xj as defined contains three compressed leaves, 
violating our requirement. In this case, we replace y with the node y' on the path 
between 71 and 72, which is the closest to y. Now, to compute mast(W / , X), we 
consider the two cases depending on whether the root of W is mapped to y' or not. 
Again, we first compute ma.st(W,X v ). Then, we define the connected components 
Ci and the smaller problem instances (Wj,Xj) with respect to y'. Every Xi has at 
most two compressed leaves, but y' may not be a separator and we cannot guarantee 
that the size of every subproblem is reduced by half. However, there can exist only 
one connected component d o with size larger than half that of A. Indeed, Ci o is the 
component containing y. In this case, both 71 and 72 are not inside Ci o , and Xi o as 
defined contains only one compressed leaf. Thus, the subproblems that mast(W / i , Xi ) 
spawns in the next recursion level each have size of at most half that of (W, X). 

With respect to y or y', computing the topology of all Xj and Wi from X and W 



is straightforward; see §5.2. Computing the auxiliary information in all Wi efficiently 



requires some intricate techniques, which are detailed in §g and §|7]. 

5.2. Computing the topology of compressed trees. The topology of Xi can 
be constructed from X by replacing the subtree X ViV of X with a shrunk leaf. Let J 
and Ji be the sets of labels in X and Xi, respectively. For the trees Wi, recall that 
the definitions of W and the trees Wi are based on affixing some nodes to the trees 
Uf\J and Uf\Ji, respectively. Observe that W\J and Uf\J have the same topology. 
Moreover, W\J % = (W\J)\J t and Uf\J t = (Z7f| J)|Jj. Thus, W\J % and Uf\ Ji have the 
same topology. We can obtain Uf | Ji by constructing W\Ji. Note that J = Ui<i<6 ^ 
and all the label sets Ji are disjoint. We can construct all the trees W\ Ji from W in 
0(n) time 0>|io|. Next, we show how to construct Wi from W\Ji in time linear in 
the size of W\Ji. We only detail the case where Xi consists of two shrunk leaves. The 
case for one shrunk leaf is similar. The following procedure is derived directly from 
the definition of the compression of Uf with respect to two subtrees. 

Let v be any node of W\ Ji. If v is not the root, let u be the parent of v in W\ Ji. 
• If A(Uf,L — Ji,v) is non-empty or equivalently the degree of v in Uf is 
different from its degree in W\ Ji, then attach auxiliary leaves z and z to v. 
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• If Ti(TJf , L— Ji, v) is non-empty or equivalently u is not the parent of v in J7f , 
then create a path between u and v consisting of two nodes p and p, attach 
auxiliary leaves z and z to p, and attach z to p. 

5.3. Time complexity of computing mast(W,X). 

Lemma 5.1. We can compute mast(W^X) in 0(n 15 \ogN) time. 

Proof. Let T(n) be the computation time of mast (IV, X). The computation is 
divided into two cases. Case 1 of § |5.l| takes 0(n 15 logiV) time. For Case 2, a set of 
subproblems {mast(Wi, Xi) \ i G [1,6]} are generated. As to be shown in §|| and §[?], 
the time to prepare all these subproblems is also 0(n 15 log AT). These subproblems, 
except possibly one, are each of size less than n/2. For the exceptional subproblem, 
say, ma.st(Wi, Xi), its computation is again divided into two cases. One case takes 
0(n 15 logiV) time. For the other case, another set of subproblems are generated in 
0(n 1 ' 5 logA r ) time. This time every such subproblem has size less than n/2. Let 
£ be the set of all the subproblems generated in both steps. The total size of the 
subproblems in £ is at most n, and 

T(n) ^Oin 1 - 5 log N)+ T (\ X '\)- 

mast(W,x')e£ 

It follows that T{n) = 0(n 15 logJV). □ 

By letting W = Uf and X = [7 2 , we have proved Lemma 3.2. What remains is 
to show how to compute the auxiliary information stored in all Wi from (W, X) in 
0(n 15 log N) time. Note that X contains at most two shrunk leaves. Depending on 
the number of shrunk leaves in X, we divide our discussion into §|^ and §|7| 

6. Auxiliary information for X with no shrunk leaf. The case of X con- 
taining no shrunk leaf occurs only when the algorithm starts, i.e., W = TJf , X = U2. 
and N — n. The subproblems mast(Wi, Xi), . . ., mast(W / 6, Xb) spawned from (W, X) 
are defined by an internal node j/ in X, which is adjacent to the nodes Vi,...,Vb. 
Let Ri and Ri denote the rooted subtrees X ViV and X yVi , respectively. Note that 
the rooted tree X v is composed of the subtrees R±, . . . , Rb- Also, Wi = W®Ri and 
Xi = WQRi. The total size of all Ri is at most n. Furthermore, each Ri is X v 



with Ri removed; see Figure 6.1. This section discusses how to compute the auxiliary 
information required by each Wi in 0(n 15 log N) time. 

6.1. Auxiliary information in the compressed leaves of Wi. Consider any 
compressed leaf v in Wi. Let S v denote the set of subtrees from which v is compressed. 
Then, the auxiliary information to be stored in v is 



(6.1) 



a(v) = max{mast(I^ z ,i? l ) | W z e S v }. 
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Observe that for any W z G S v , W z contains no labels outside Ri. So mast(VF z , Ri) — 
masi{W z , X y ) and we can rewrite Equation ( JTl| ) as 

a(v) = max{mast(W rz ,X 3/ ) \ W z G S v }. 

We use the rooted subtree algorithm of jllj to compute mast(VF, X y ) in 0(n 15 logiV) 
time. Then, we can retrieve the value of mast(M^ z , X v ) for any node z G W in O(l) 
time. To compute max{mast(PF z , X y ) \ W z G 5„} efficiently, we assume that for any 
node u G VF, the subtrees attached to it are numbered consecutively, starting from 1. 
We consider a preprocessing for efficient retrieval of the following types of values: 

• for some node u GW and some interval [a,b], max{mast(M /z , X y ) \ W z is a 
subtree attached to u and its number falls in [a, 6]}; 

• for some path P of W, max{mast(VF z , X y ) \ W z is a subtree attached to 
some node in P}. 

Lemma 6.1. Assume that we can retrieve mast(VF z , X v ) for any z G W in 0(1) 
time. Then we can preprocess W and X and construct additional data structures in 
0(n log* n) time so that any value of the above types can be retrieved in 0(1) time. 
Proof. We adapt preprocessing techniques for on-line product queries in || . □ 



With the preprocessing stated in Lemma 6.1, we can determine a(v) as follows. 
Note that S v is either a subset of the subtrees attached to a node u in W or the set 
of subtrees attached to nodes on a particular path in W. In the former case, u is 
also a parent of v and S v is partitioned into at most d u + 1 intervals where d u is the 



degree of u in Wi. From Lemma 6.1, a(v) can be found in 0(d u + 1) time. Similarly, 
for the latter case, a(v) can be found in O(l) time. Thus, the compressed leaves 
in Wi are processed in OdW 7 ^) time. Summing over all Wi, the time complexity is 
O(n). Therefore, the overall computation time for preprocessing and finding auxiliary 
information in the leaves of all Wi is 0(n 15 logiV). 

6.2. Auxiliary information in the internal nodes of Wj. Consider any 
internal node v in Wi with i G [1, b]. Our goal is to compute the auxiliary information 
a(v) = m&st(W v ,Ri). Note that Ri may be of size Q(n), and even computing one 
particular mast(lL™, Ri) already takes 0(n 15 log N) time. Fortunately, these Ri are 
very similar. Each Ri is X v with Ri removed. Exploiting this similarity and using 
the algorithm in §|| for all-cavity matchings, we can perform an 0(n 15 log A^)-time 
preprocessing so that we can retrieve mast(W v , Ri) for any internal node v in W and 
i G [1,6] in 0(log 2 n) time. Therefore, it takes OdW 7 ^ log 2 n) time to compute a(v) 
for all internal nodes v of one particular Wi, and 0(n log 2 n) time for all Wi. The 
0(n 15 log iV)-time preprocessing is detailed as follows. 

First, note that if we remove y from X v , the tree would decompose into the 
subtrees Ri, . . . ,Rb- Thus, the total size of all Ri is at most n. The next lemma 
suggests a way to retrieve efficiently ma.st(W v , Ri) and max{mast(M /l ', Rj) \ j G /} 
for any v G W and and / C [1,6]. 

Lemma 6.2. We can compute mast(M /r , Ri) for all i G [1, 6] in 0(n 15 logiV) time. 
Then, we can retrieve m&st{W v , Ri) for any node v in W and i G [1,6] in O(logn) 
time. Furthermore, we can build a data structure to retrieve max{mast(14 Al ', Rj) \ j G 
/} for any v G W and JC [1,6] in 0(log 2 n) time. 

Proof. This lemma follows from the rooted subtree algorithm and related data 
structures in □ 

Below, we give a formula to compute ma.st(W v , Ri) efficiently. For any z G W 
and i G [1,6], let r-mast(VF z , Ri) denote the maximum size among all the agreement 
subtrees of W z and Ri in which z is mapped to the root of Ri. 
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Lemma 6.3. 



m&st(W v , Ri) = max 



max{mast{W v , Rj) | j £ [1,6], j ^ i}; 
max{r-mast(W /z ,i? l ) | z € W v }. 



Proof. Observe that mast(W v , Ri) — mast(W /z , Ri) = r-m&st(W z , Ri) if in some 
maximum agreement subtree of W v and Ri, the root of Ri is mapped to some node z 
in W v . On the other hand, ma,st(W v , Ri) = mast(W v , Rj) for some j ^ i if in some 
maximum agreement subtree of W v and the root of Ri is not mapped to any node 
z in W v . □ 

By Lemma [T^, we decompose the computation of ma.st(W v ,Ri) into two parts. 
The value max{mast(VF 1 ', Rj) \ j € [l,b],j ^ i} is determined by answering two 
queries ma,x\mast(W v , Rj) \ j G [l,i — 1]} and max{mast(W v , Rj) | j G [z + 1, 6]} in 
0(log 2 n) time by Lemma The computation of maxjr-mast^VF 2 , Ri) \ z € W v } 
makes use of a maximum weight matching of some bipartite graph as follows. 

Let Ch(z) denote the set of children of a node z in a tree. Let G Zj i C Ch(z) x 
{Ri, ■ ■ ■ , Ri-i, Ri+i, ■ ■ • , Rb} be a bipartite graph where w € Ch(z) is connected to 
Rj if and only if mast(W w , Rj) > 0. Such an edge has weight mast(W™, Rj) < N. 

Fact 6.4 (see If the root of Ri is mapped to z in some maximum agreement 

subtree of W z and Ri, then a maximum weight matching of G z .i consists of at least 
two edges, and mwm(G z i) = r-mast(M^ z , Ri). 

Note that if a maximum weight matching of G z i consists of one edge, it corre- 
sponds to an agreement subtree of W z and Ri in which the root of Ri is not mapped 
to any node in W z . Thus, it is possible that mwm(G 2 i ) > r-mast(W /z , Ri). Never- 
theless, in this case we are no longer interested in the exact value of r-mast(Vt /z , Ri) 
since in a maximum agreement subtree of W z and Ri, the root of Ri is not mapped 



to any node in W z . In fact, Lemma 6.3 can be rewritten with the r-mast(W z , Ri) 



replaced by mwm(G Z j). Furthermore, since G z ,i, G z ,2, ■ ■ ■, G z ^ are very similar, the 
weights of a maximum weight matching cannot be all distinct. 

Lemma 6.5. At least b — d z o/mwm(G 2i i), mwm(G Z; 2)> ■■■> mwm (G Zi b) have 
the same value, where d z denotes the degree of z in W . 

Proof. Consider the bipartite graph K C Ch(z) x {R\, . . . ,Rb} in which a node 
w € Ch(z) is connected to Ri if and only if mast(W w , Ri) > 0. This edge is given a 
weight of ma,st(W w , Ri). Then, every G Zt i is a subgraph of K . Let M be a maximum 
weight matching of K. Observe that if an Ri is not adjacent to any edge in M , then 
M is also a maximum weight matching of G z ,i. Since M contains at most d z edges, 
there are at least b — d z trees Ri not adjacent to any edge in M and the corresponding 
mwm(G Z; i) have the same value. □ 

We next use 0(n 15 log N) time to find for all z in W, mwm(G Zi i), . . . , mwm(G Zj b). 
The results are to be stored in an array A z of dimension b for each node z, i.e., 
A z [i] = mwm(G Zi i). Note that if we represent each A z as an ordinary array, then 
filling these arrays entry by entry for all z € W would cost fi(6n) time. Nevertheless, 



by Lemma 3.5, most of the weights mwm(G Zj i) have the same value. Thus, we store 
these values in sparse arrays. Like an ordinary array, any entry in a sparse array A 
can be read and modified in O(l) time. In addition, we require that all the entries 
in A can be initialized to a fixed value in O(l) time and that all the distinct values 
stored in A can be retrieved in 0(m) time, where m denotes the number of distinct 
values in A. For an implementation of sparse array, see Exercise 2.12, page 71 of 
Before showing how to build these sparse arrays, we illustrate how they support 
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the computation of 



(6.2) max{mwm(G ZiJ ) | z E W v } = max{A z [i\ \ z E W v }. 

An efficient data structure for answering such a query is given in §[b]. Let m z be the 
number of distinct values in A z , and m = X)zew( TO £ ■'■)■ Let a ( n ) denote the inverse 
Ackermann function. Appendix |b] shows how to construct a data structure on top of 
the sparse arrays A z in 0(ma(|W|)) time such that we can retrieve for any v E W 



and i € [f , b] the value of max{A z [i] | z 6 W v } in 0(log |W|) time. From Lemma 6.5, 
m z < d z + 1 for all z E W; thus, m = 0(|W|). Therefore, the data structure can 
be built in time 0(ma(|W|)) = 0(|W|a(|W|)) = O(nlogn) and the retrieval time of 
Equation Q is O (log | W | ) = O (log n). 

To summarize, after building all the necessary data structures, we can retrieve 
max{mast(W" , Rj) \ j E [l,b],j ^= i} in 0(log 2 n) time and max{r-mast(VF z , R t ) \ 
z E W v } in O(logn) time. Hence, for any v E W and i E [1, b], mast(W v , Ri) can be 
computed in 0(log 2 n) time. 

To complete our discussion, we show below how to construct a sparse array A z 
or equivalently compute the weights {mwm(G 2j i) | i E [1,6]} efficiently. We cannot 
afford to examine every G Zi i and compute mwm(G Zi j) separately. Instead we build 
only one weighted graph G z C Ch(z) x {Ri, . . . , Rb} as follows. 

For a node z in W, the max-child z' of z is a child of z such that the subtree 
rooted at z' contains the maximum number of atomic leaves among all the subtrees 
attached to z. Let k(z) denote the total number of atomic leaves that are in W z but 
not in W z . The edges of G z are specified as follows. 

• For any non-max-child u of z, G z contains an edge between u and some R^ if 
and only if mast(Fy u , Ri) > 0. There are at most k(z) such edges. 

• Regarding the max-child z' of z, we only put into G z a limited number of 
edges between z' and {Ri, ■ ■ ■ ,Rb}- For each Ri already connected to some 
non-max-child of z, G z has an edge between z' and Ri if mast(VK z , Ri) > 
0. Among all other Ri, we pick Ri> and R411 such that mast(W" ,Rv) and 
mast(M /z , Ri") are the first and second largest. 

• Every edge (u, Ri) in G z is given a weight of ma,st(W u , Ri). 

Lemma 6.6. For all i E [1,6], mwm(G 2 - {Ri}) = Ewm(G Zil ). Furthermore, G z 
can be built in 0((k(z) + 1) log 2 n) time. 

Proof. The fact that mwm(G z - {Ri}) = mwm(G z ,j) follows from the construction 
of G z . Note that G z contains 0(k(z) + 1) edges. All edges in G z , except (z' , Ri>) and 
(z',Ri"), can be found using Q(k(z)) time. The weight of these edges can be found 



in 0(n(z)logn) time using Lemma 3J2. To identify (z',Ri>) and (z',Ri»), note that 
at most k(z) instances of Ri are connected to some non-max-child of z. All other Ri 
are part itio ned into at most k(z) + 1 intervals. For each interval, say I C [1,6], by 
Lemma the corresponding mast(W /,z , Ri) which attains the maximum in the set 
{mast(VF z ,Rj) \ j E 1} can be found in 0(log 2 n) time. Thus, by scanning all the 
k(z) + 1 intervals, R^ can be found in 0({n(z) + l)log 2 n) time. Ri" can be found 
similarly. □ 

Since G z contains 0(n(z) + l)edges, and each edge has weight at most N, we 



use the Gabow-Tarjan algorithm jl3[ to compute mwm(G 2 ) in 0{yj k(z) + 1(k(z) + 
l)logA r ) time. Then, using our algorithm for all-cavity maximum weight matching, 
we can compute mwm(G 2 — {Ri}) for all i E [1,6], and store the results in a sparse 
array A z in the same amount of time. 

Thus, all G z with z E W can be constructed in time X^zew 0(( K ( Z ) + 1) 1°§ 2 n ); 
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which is Oin lb log AT) as ^2 zeW n(z) = 0(n\ogn) |5). Given all G z , the time for 
computing A z for all z G W is 0(J2 z£W (Mz) + l) 1 - 5 log N). 
Lemma 6.7. E zG iv( K ( z ) + l) 15 log AT = 0(n 15 log AT). 

Proof. Let T(VF) = J2 z ew( K ( z ) + l) 1,S logiV. Let P be a path starting from 
the root of W such that every next node is the max-child of its predecessor. Then 
J2zeP K ( z ) — |W| — n - Let x{P) denote the set of subtrees attached to some node 
on P. The subtrees in x(P) are label-disjoint and each has size at most n/2. Thus, 

T(W) < («(*) + l) 1 - 5 log iV + T ( W ') 

zgp W'ex(P) 

<n 15 \ogN + T(W') 

w'ex(P) 
= 0(n 1,5 log AT). 



7. Auxiliary information for X with one or two shrunk leaves. 

7.1. X has one shrunk leaf. Consider the computation of mast(W, X). Ac- 
cording to the algorithm, mast(W,X) will spawn b subproblems mast(Wi, Xi),. . ., 
mast(W / f ) , Xb), which are defined by an internal node y in X adjacent to the nodes 
vi, . . . , Vb- Also, for every i £ [1,6], Ri and Ri denote the subtrees X ViV and X yVi , re- 
spectively. Suppose that X has one shrunk leaf and without loss of generality, assume 
that the shrunk leaf of X is in Rb, i.e., Xj, has two shrunk leaves and all the other Xi 
have one shrunk leaf each. This section shows how to find the auxiliary information 
required by Wj. , . . . , Wj in 0(n 15 log AT) time. 

Lemma 7.1. The auxiliary information required by W\, ■ ■ ■ , Wb-i can be computed 
in 0(n 15 log N) time. 

Proof. Note that mast(Wi, X\), . . . , mast(Wj,_i, X^-i) are almost identical to the 
subproblems considered in in that all the have exactly one shrunk leaf each. 
Using exactly the same approach, we can compute the auxiliary information in W±, 

W b -i. □ 

The remaining section focuses on computing the auxiliary information in Wb- Let 
71 and 72 be the two shrunk leaves of Xb- Assume that 71 is also a shrunk leaf in 
X, and 72 represents Rb- Let Q + be the subtree obtained by connecting 71 and Rb 
together with a node. To compute the auxiliary information in Wb, we require the 
values mast (W v , 71), mast(W™, Rb), and mast(W v ,Q + ) for all nodes v € W. These 
values are computed based on the following lemma. 

Lemma 7.2. mast (W v , 71), ma,st(W v , Rb), and mast(W™, Q + ) for all nodes v g 
W can be computed in 0(n log N) time. 

Proof. By Lemma 4.2, mast(W,Rb) and mast(M / , Q + ) can be computed in time 



0(n 15 log N) and afterwards, for each node v G W, mast(W"",Rb) and mast(W-™, Q + ) 
can be retrieved in O(l) time. For each node v € W, mast (W v , 71) is the auxiliary 
information stored at u in W and can be retrieved in 0(1) time. □ 

Now, we are ready to compute the auxiliary information stored at each n ode 
v 6 Wb- No auxiliary information is required for atomic leaves. Below, Lemma |7.3| 



and Lemma 1_A show that using 0(n) additional time, we can compute the auxiliary 
information in internal nodes and in compressed leaves, respectively. In summary, the 
auxiliary information in W%, . . . , Wb can be computed in 0(n 15 log N) time. 
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Lemma 7.3. Given mast (W v , 71) , mast(W v , R b ) , and mast(W, Q + ) for all 
nodes v G W, the auxiliary information stored at the internal nodes in W b can be 
found in 0(n) time. 

Proof. Let J b be the set of labels of the atomic leaves of W b . An internal node v 
can be either an auxiliary node, a compressed node, or a node of W\ Jb- If i>€ W\ Jb, 
then v eW. Thus, an(v) = mast (W", 71) and a 2 (v) = mast(W v , R b ). 

If v is a compressed node, then we need to compute cti(v), a 2 (v) and a + (v). 
Recall that v represents some tree path u = v\, . . . , Vk of W, where v\ is the closest 
to the root, i.e., v = v\. Thus, a\(v) — mast(W Vl , 71), a2(v) = ma,st(W Vl , Rb), and 
a+(v) = mast(W Vl ,Q+). 

Thus, 0(n) time is sufficient for finding the auxiliary information stored at every 
internal node of Wb- □ 

Lemma 7.4. Given mast(W, 71), mast(W v , R b ) , and mast(W, Q+) for all 
nodes v e W , the auxiliary information stored at the compressed leaves in Wb can 
be found in 0(n) time. 

Proof. If v is a compressed leaf in W, u's parent u must not be an auxiliary node. 
Depending on whether u is a compressed node, we have two cases. 

Case A: u is not a compressed node. We must compute a\{v), 0:2(1*) j a +{ v ), Pi v )- 
Note that u is also in W. When Wb is constructed from W, some of the subtrees 
of W attached to u are replaced by v and no longer exist in Wb- Let W Pl , . . . , W Pk 
be these subtrees. Observe that both v and W Pl , . . . , W Pk represent the same set of 
subtrees in T\. Thus, 

• ct\{v) = max{mast(W /Pi , 71) | 1 < i < k}; 

• a 2 (v) — max{mast(Fy p % i?b) | 1 < i < fc}; 

• a+(v) = max{mast(VK p ' , Q+) \ 1 < i < fc}; 

• 0(v) = max{mast(W^ , 71) + mast(W p i , R b ) \ 1 < i ^ j < k}. 

These four values can be found in 0(k) time. Since W Pl , . . . , W Pk are subtrees at- 
tached to u in W, k is at most the degree of u in W. Moreover, the sum of the 
degrees of all internal nodes of W is 0{n). Therefore, 0{n) time suffices to compute 
the auxiliary information for all the compressed leaves in Wb whose parents are not 
compressed node. 

Case B: u is a compressed node. We need to compute a\{v), a 2 {v), a + (v), f3(v), 
0iy2(v) and f3±y 2 (v). Note that u is compressed from a tree path pi, . . . ,pi~ in W 
where p\ is the closest to the root. Moreover, v is compressed from the subtrees 
hanging between pi and pu- For every i e [l,fc], let % be the set of subtrees of 
W attached to pi that are compressed into v. Both v and the subtrees in Ui<i<fc7i 
represent the same set of subtrees in T\. The auxiliary information stored at v can 
be expressed as follows. 

• Oi(i>) = max{mast(FT 9 , 71) | W q <G % for some i E [l,k]}. 

• oe 2 (v) = max{mast(W 9 , R b ) \ W q e % for some i e [1, k]}. 

• a+(v) = max{mast(IF«, Q+) \ W q € % for some i e [1, fc]}. 

• P(v) = maxi< 4 < fe [max{mast(VK«,7i) + m&st{W q ' ,R b ) j W q ,W q ' e %}]. 

• f3iy 2 {v) = maxi< :)<J < fe [max{mast(VT 9 ,7i) | W q e 7^}+max{mast(VK' z ', R b ) | 

W q 'eTj}]. 

• /3 2 yi(v) = maxi<j <i <fc[max{mast(lT /9 , R b ) \ W q G 7i}+max{mast(Fy ? ,71) | 

W q eTj}}. 

These values can be found in 0(^2 l<i<k d Pi ) time, where d Pi is the degree of pi in W. 
Thus, the auxiliary information for every compressed leaf of W b , whose parents are 
compressed nodes, can be computed in 0{n) time. □ 
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7.2. X has two shrunk leaves. Recall that the subproblems mast(Wi, X{), 
. . ., mast(Wb, Xb) are spawned from mast(W,X). This section considers the case 
where X has two shrunk leaves. Without loss of generality, assume that the two 
shrunk leaves are in Rb-i and Rb, respectively. Then, X±, . . ., Xb-i each have one 
shrunk leaf. Xb-\ and Xb each have two shrunk leaves. Below, we show how to 
compute the auxiliary informations of Wi, . . . , Wb in 0(n 15 \ogN) time. 

Lemma 7.5. The auxiliary information required by Wi, . . . , Wb-2 can be computed 
in 0(n 15 log N) time. 

Proof. The proof of this lemma is the same as that of Lemma [tt|. □ 

For the remaining subproblems mast(Wfc_i, Xb-i) and mast(Wb, Xb), both Xb-i 
and Xb have two shrunk leaves. By symmetry, it suffices to discuss the computation 
of m&st(Wb, Xb) only. Lemma [7.6| shows that the auxiliary information in Wb can be 



computed in 0(n \ogN) time. Therefore, the auxiliary information in Wi, . . . ,Wi 



b 

can be computed in 0(n 15 logiV) time. 

Lemma 7.6. The auxiliary information in Wb can be computed in 0(n 15 logN) 
time. 

Proof. Let 71 and 72 be the two shrunk leaves of Xb- Assume that 71 is also 
a shrunk leaf in X and 72 represents Rb, i.e., 72 represents the subtree U 2 bV of T\. 
Let Q + be th e su btre e ob tained by connecting 71 and Rb- By the same argument 
as in Lemma [7.3| and [7.4| , the auxiliary information in Wb can be computed based 
on the values mastfW", 71), mast(W v , R b ) and mast(W v , Q + ) for all v € W. The 
value mast(W v , 71) can be found in W. The values mast(W v , Rb) and rnast(W v , Q + ) 
for all v £ W can be retrieved in O(l) time after ma,st(W,Rb) and mast(W / , Q + ) are 



computed in 0(n 15 log N) time based on Lemma 4.4. Then the auxiliary information 
in Wb can be computed in 0(n) time. □ 

8. Extension. We have presented an 0(N 1 ' 5 log 7V)-time algorithm for comput- 
ing a maximum agreement subtree of two unrooted evolutionary trees of at most N 
nodes each. This algorithm can be modified slightly to compute a maximum agree- 
ment subtree for two mixed trees Mi and M2. 

For a mixed tree M, a node i is consistent with a node u if the directed edges on 
the path between u and t all point away from u. Let M u be the rooted tree constructed 
by assigning u in M as the root and removing the nodes of M inconsistent with u. 
Given two mixed tree M\ and Mi, we define a maximum agreement subtree of M\ and 
M2 to be the one with the largest number of labels among the maximum agreement 
subtree of M" and M| over all nodes u £ M\ and v £ M 2 . That is, 

mast(Mi,M 2 ) = max{mast(M 1 u , M% ) \ u £ M\,v £ M 2 }. 

As in the unrooted case, to compute mast(Mi,M 2 ), we find a separator y of 
Mi and compute mast(Mf,M 2 ). However, we need to delete the nodes of Mi not 
in M\. When computing mast(Mf, M 2 ), we construct some rooted subtrees of M 2 . 
Again, we delete the nodes of M 2 not in these rooted subtrees. Such deletions are 
straightforward and do not increase the time complexity of computing mast(Afi, M 2 ). 
Thus, mast(Mi,M2) can be computed in 0(N 15 log N) time. 

Acknowledgments. The authors thank the referees for helpful comments. 

Appendix A. Computing mast(Wi,W2). Let T\ and T 2 be rooted evo- 
lutionary trees. Let Ri and R 2 be two label-disjoint rooted subtrees of T 2 . Let 
Wi = Ti®(Ri,R 2 ) and W 2 = T 2 Q(R 1 ,R 2 ). This section shows that mast(W u Wg) 
can be computed as if Wi and W 2 were ordinary rooted evolutionary trees [pj|IT|,[20f| 
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with some special procedures on handling compressed and shrunk leaves. Note that 
the case where W\ and W% are compressed and shrunk with respect to a subtree can 
be treated as the special case where R\ is empty. 

Lemma A.l. We can compute mast(Wi, W2) in 0(n 15 log N) time, where n = 
m&x{\Wi\, I W2I} and N = max{|Ti|, |T 2 |}. Then, we can retrieve mast(W / 1 u , W2) for 
any node u ofW\ in 0(1) time. 

Proof. We adopt the framework of Farach and Thorup's algorithm pj , which 
is essentially a sparsified dynamic programming based on the following formula. For 
any internal nodes u of W\ and v of W% , 

{max{mast(M / f , W% ) | x is a child of u}; 
maxjmastCWf, W%) \ y is a child of u}; 
r-mast(VF 1 u , W^), 

where r-mast(W / ", ) denotes the maximum size of all the agreement subtrees of 
W" and W$ in which u is mapped to v. 

Our algorithm differs from Farach and Thorup's algorithm in the way how each 
individual mast (FT", W% ) is computed. When Wi and W2 are ordinary evolutionary 
trees, each mast(W", W$ ) is found by computing a maximum weight matching of some 
bipartite graph, and it takes 0(n 15 logn) time to compute mast(Wi, W^)- Below, we 
show that when Wi and W% have compressed and shrunk leaves, each mast(W", W% ) 
can be found either in constant time or by computing at most two maximum weight 
bipartite matchings of similar graphs but with edge weights bounded by N instead of 
n. Thus, we can compute mast(Wi, W2) using the same sparsified dynamic program- 
ming in JLT| |; as a by-product, we can afterwards retrieve mast(W r ", W%) for any node 
u of W\ in O(l) time. The enlarged upper bound of edge weights increases the time 
complexity to 0(n 15 log AT), though. 

In the rest of this section, we show how each mast(H / 1 tl , W% ) is computed. First, 
we consider the case when u is a leaf. The following case analysis shows that O(l) 
time suffices to compute mast(W", W% ). 

Case T. u is an atomic leaf. If contains a leaf with the same label as that of 
it, then mast(W", Wg) = 1; otherwise it equals zero. 

Case 2: u is an auxiliary leaf. Then, mast(W / 1 tl , W% ) = 0. 

Case 3: u is a compressed leaf. By definition, u can only be mapped to 71, 72 
or the least common ancestor y c of 71 and 72. If VF 2 has no shrunk leaves, then 
masttWf.WJ) = 0. If W$ has only one shrunk leaf, say 71, then mast(PF 1 u , W%) = 
a x {u). If W% has two shrunk leaves, W% must also contain y c and mast(W / ", W£ ) = 
max{o! 1 (it), cx 2 {u), a + (u)}. 

Next, we consider the case when u is an internal node. Assume that v is an atomic 
leaf. Then mast(W", W2 ) = 1 if contains a leaf with the same label as that of 
v, and zero otherwise. If v is a shrunk leaf, say 71, then mast(W", W% ) = oti(u). 
It remains to consider the case when v is an internal node. Due to the nature of 
dynamic programming, we only need to compute r-mast(W / 1 ", W% ), then we can apply 



the Equation (A.l) to compute mast(W / 1 u , W% )■ We further divide our discussion into 
the following three cases. 

Case 1: u is an auxiliary internal node. In such case, u has only two children, 
one of them is an auxiliary leaf. From definition, an auxiliary leaf will not be mapped 
to any node in any agreement subtree of W\ and W% ', thus, there is no agreement 
subtree in which u is mapped to v and r-mast(W^ W% ) = 0. 

Case 2: u is an ordinary internal node. As in we first construct the bipartite 
graph defined as follows: Let A and B be the set of children of u and v, respectively; 
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define G[A, B] to be the bipartite graph formed by the edges (x,y) € Ax B with 
mast(Wf , W%) > 0, and (a;, y) is given a weight mast(Wf , W 2 y ). 

If none of it's children is an auxiliary leaf, then r-mast(W / 1 u , W% ) = mwm(G[A, £?]). 
Otherwise, let z be the child of M which is an auxiliary. In this case, u also has a 
compressed child z. Other than z and z, no other child of u is a compressed and 
auxiliary leaf. On the other hand, consider the rooted subtrees W% rooted at the 
children x of v. If the shrunk leaves appear together in one of such subtrees, then by 
definition, z cannot be mapped to any shrunk leaf in any agreement subtree of 
and W%, and r-mast(W™, WJ) = mwm(GL4, B] ~ {z}). If the shrunk leaves appear 
in two different subtrees rooted at two children y\ and yi of v, then 

r-mast(W 1 u , W$) = max {mwm(G[A, B]-{z}),sxmm{G[A, B] — {z, z, yi , y 2 })+/3(z)\. 

Case 5: u is a compressed internal node. By definition of a compressed node, the 
structure of W" is very restrictive — u has exactly three children z, z, and an auxiliary 
internal node u; u has two children, an auxiliary leaf z and an uncompressed internal 



node w\ see Figure A.l. To find r-mast(W", W% ), we note that there are only six 
possible ways on how the z,z,z are mapped to 71 and 72. We consider each of these 
cases and r-mast(W r ", W£) is the maximum of the values found. We only discuss the 
case where 71 and 72 are mapped to z and z, respectively. The other cases can be 
handled similarly. Let P be the path between 72 and y c . Let S(P) denote the set of 
subtrees hanged on P. The size of the largest agreement subtrees of and W% in 
which 71 and 72 are mapped to z and z, respectively, equals 

(A.2) Piy2(z) + max{mast(WiV) | r e S(P)}. 

Note that using the technique in |lT[ , we can precompute max{mast(W r f , r) | r G 
5(P)} for all x e Wi in 0(n 15 log^) time. Afterwards, ( O ) can be found in 
constant time. □ 

Appendix B. Preprocessing for finding max{A z [z]|2: € W"}. 

Let h be the number of nodes in W. Consider the h arrays A z of dimension b where 
z G W. Recall that m z is the number of distinct values in A z , and m = X3 z£ ^(to 2 + 
1). This section describes an 0(ma(h))-tim.e preprocessing, which supports finding 
max{A 2 [i] I z G W v }, for any i G [1, b] and any node v of W, in 0(logh) time. 

By definition, each A z has at least b — m z entries storing some common value c z . 
For every i G [1,6], let 1^ be the set of nodes z where A z [i] stores a value different 
from c z . Note that m z = J2i<i<b ^ e assume that each node of W is identified 
uniquely by an integer in [1, h] assigned by a preorder tree traversal ||]. For any node 
v G W, let P(v) be the number of proper descendents of v in W. 
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Based on Lemma [B.l| , max{A z [i] | z £ W v } — max{A 2 [i] | z £ [v,v + P(v)]} 
for any v £ W, i £ [l,b]. Therefore, to solve our problem, it is sufficient to give an 
0(ma(h))-time preprocessing to support finding max{A z [i] | z £ H} for any i £ [1, b] 
and any interval H C [1, h] in 0(logh) time. 

Lemma B.l. For any v £ W and i £ [1,6], max-j^A^i] | z £ W v } — max{yl z [i] | 

Z £ [v, V + f3(v)]}. 

Proof. Straightforward. □ 

Our preprocessing does not work on each sequence Ai [i] , A% [i] , . . . , Ah [i] directly. 
Instead, it first draws out useful information about the common values c z stored in 
the sequences and applies a contraction technique to shorten each sequence. Then, it 



executes Fact B.2 on these shortened sequences. 

Fact B.2 (see ||). Given any sequence a\,...,ah of real numbers, we can 
preprocess these h numbers in 0(h) time so that we can find the maximum of any 
subsequence a x , a x +i, . . . , a y in 0(a{h)) time. 

Our preprocessing is detailed as follows. Its time complexity is 0{ma{h)) as 
shown in Lemma |B.3| . 

1. For each i £ [1,6], find Ti and arrange the integers in Ti in ascending order. 

2. Apply Fact to the sequence c%, . . . , 

3. For every non-empty Ti = {x\ < . . . < Xd}, compute P i = max{A x [i] \ x £ 
(xg, xi + i)} for every £ £ [1, d — 1], and then apply Fact |B.2j to the sequence 
A xi [i], fa, A X2 [i], . . . , Pd-i, A Xd [i], 

Lemma B.3. The preprocessing requires 0{ma{h)) time. 

Proof. We can examine all the entries of A z whose values differ from c z in 0(m z ) 
time. By examining all such entries of Ai, . . . , Ah, we can construct F, and arrange 
the integers in Ti in ascending order. Thus, Step 1 takes 0(m) time. Step 2 takes 
O(h) — 0(m) time. For Step 3, we first analyze the time required to process one 
nonempty Ti = {x\ < ... < Xd}. Note that (xg,xe + i) fl Tj = (j) for every I £ 
[l,d— 1]. Thus, Pi = max-fcz | x £ (xe,X£ + i)} can be computed in 0(a(n)) time 
using the result of Step 2. Su mmin g over all £ £ [l,d— 1], computing all Pe takes 
0(\Ti\a(h)) time. Applying Fact |B~^ to the sequence A Xl [i] , Pi, A X2 [i], . . . p d -i, A Xd [i] 
takes 0(|rj|) time. In total, it takes 0(\Ti\a(h)) time to process one T i: and Step 3 
takes 0(J2 Wi\ct(h)) = 0{ma{h)) time. Thus, the total time of our preprocessing is 
0(ma(h)). □ 

After the preprocessing, each query can be answered in O (log h) time as stated 
in the following lemma. 

Lemma B.4. After the preprocessing, max{A 2 [i] | z £ H} can be found in 
O(logn) time for any i £ [1, b] and any interval H C [1, h]. 

Proof. Let H = [p,q]. A crucial step is to find [p, q] Ti. Without loss of 
generality, assume Ti ^= <f>. To find [p, q] H Tj, we first find the smallest integer x s in 
Ti that is greater than p, and the largest integer xt in Ti that is smaller than q. Since 
r,; is sorted, we can find x s and Xt in 0(log | ) = 0(\ogh) time. If x s > xt, then 
[p, q] H Ti — <f>; otherwise, [p, q] n Ti is the set of integers between x s and Xt in Tj. 

If [p, q] n r.; = tfi, then maxlAr^] | x £ [p, q]} — maxjca- | x £ [p, q]}. Because of 
Step 1 of our preprocessing, we can find max{c K | x € [p, q]} in 0(a{h)) time. 

If [p, q] n r 4 = {x s < x s+ i < . . . < x t }, then [p, q] = [p, x s - 1] U {z s } U (x s ,x s+ i) 
U • • ■ U {x t } U [xt + 1, q] and max{A z [i] | z £ [p, q]} equals the maximum of 

1. max-tA^i] \ x £ [p, x s - 1]}, 

2. max{A4i] | x £ {x s } U (a; s ,a; s+ i) U ■ ■ • U (a;t_i,a; t ) U {x t }}, 

3. maxjAj;^] \ x £ [xt + l,?]}- 

21 



Note that Item 2 equals the maximum of A Xe [i], (3 a , . . . , (3 t -i, A Xt [i], which can be 
computed in 0(a(h)) time after Step 3 of our preprocessing. Since I\ fl [p, x s — 1] = <j> 
and (~l [xt + 1, q] = (f>, Step 2 enables us to compute Items 1 and 3 in 0(a(h j) time. 
As a result, max{^4 z [i] | z € \p,q]} can be answered in 0(logh) time. □ 
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